|
|
Accession Number |
TCMCG019C36179 |
gbkey |
CDS |
Protein Id |
XP_022924550.1 |
Location |
complement(join(1743528..1743758,1743926..1744660,1745297..1745596,1745709..1745769,1745874..1746017,1746106..1746362,1746462..1746674,1746925..1747098,1747252..1747319,1751464..1751522,1751708..1751783,1751871..1751976,1752078..1752187,1752368..1752532,1752623..1752708,1752948..1753012,1753240..1753287,1753372..1753491,1753607..1753682,1753826..1753923,1754032..1754179,1755195..1755310)) |
Gene |
LOC111432000 |
GeneID |
111432000 |
Organism |
Cucurbita moschata |
|
|
Length |
1151aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA418582 |
db_source |
XM_023068782.1
|
Definition |
DNA mismatch repair protein MSH1, mitochondrial isoform X2 [Cucurbita moschata] |
CDS: ATGTACTGGGTGGCTACCCGAAACGTCGTTTCTTTCTCCCGGTGGCGTTTTTTGGCGCTTTTGATTGGCTTCCCTCCGCGCAACTTCACCCCATTTACTCACTCACCGGCGTTTTTAAACAGTGAAAGGCAACAGCTTGAGAAGTTGCAGTTTGGAAAAGGTAGAAAATATTCAGGAGGAAGCATCAAAGCTGCTAAGAAGTTTAAAGATATTAATAATGTCCAAGACGATAAGTTCCTTTCTCACATTTCATGGTGGAAAGAGATGGTGGAATCATGCAAGAAACCGTCGTCGGTTCAGCTGGTTAAGAGGCTTGACTTCTCCAATTTGCTTGGTTTAGATATTAACCTGAAAAATGGGAGTCTTAAAGAAGGAACACTTAACTGGGAGATACTACAGTTCAAGGCAAAGTTTCCTCGAGAAGTTTTGCTTTGTAGAGTTGGAGATTTTTACGAAGCAATTGGAATAGATGCTTGCATACTTGTCGAATATGCTGGTTTGAATCCTTTTGGAGGTCAGCGTATGGATAGCGTTCCGAAAGCTGGTTGCCCTGTTGTGAATCTTCGTCAAACTTTGGATGATCTGACTCGTAACGGGTTCTCAGTGTGCATAGTGGAAGAAGTTCAAGGACCAATGCAAGCTCGTTCTCGCAAAGGACGTTTTATATCTGGGCATGCACACCCGGGCAGTCCCTATGTCTTTGGGCTTGTTGGGGTTGATCATGATCTCGACTTTCCAGAACCAATGCCTGTGGTCGGAATATCTCGATCTGCAAGAGGCTATTGCATAAGCCTTGTGATAGAGACCATGAAGACATATTCGTCAGAGGATGGTTTGACAGAGGAGGCCTTGGTTACTAAGCTGCGCACTTGTCAATACCATCATTTATTTCTTCACACTTCATTAAGGAACAACTCCTCAGGTACTTGTCGCTGGGGTGAATTCGGTGAGGGTGGTCGGCTATGGGGGGAATGTAATTCCAGACATTTCGAGTGGTTCGATGGAAATCCTCTTACTAATCTTTTGTCTAAGGTTAAAGATCTTTATGGTCTTGATGATGAAGTTACATTTAGGAACGTAACGATATCGTCCGAAAATAGGCCACATCCATTAACACTGGGAACTGCAACACAGATTGGTGCCATACCAACAGAGGGAATACCGTGTTTGTTGAAGGTGTTGCTTCCATCAAATTGTGCTGGCCTTCCTGCATTGTATATCAGGGATCTTCTTCTCAATCCTCCTGCTTATGAGATCGCGACCACTATTCAAGCAACATGCAGGCTTATGAGCAATGTCACATGTGCAATTCCAGACTTCACTTGCTTTCCACCCGCCAAGCTCGTGAAGTTACTGGAAATGAGGGAAGCCAATCATATTGAGTTCTGTAGAATGAAGAACGTACTCGACGAAATCTTACACATGCATAAAAATTGCGAGTTAAGCAATATCCTGAAATTGTTGATGGATCCTTCATCTGTGGCAACTGGGTTGAAAATTGACTACGATACATTTGTTGACAAATGTGAATGGGCTTCCAGTAGAGTTGGCGAAATGATTTTTCTCGATAATGAAAGCGAAAGCGATCAGAAAATCAATTCTTATTTTATCATTCCTAATGATTTTTTTGAGGATATGGAATCTTCTTGGAAAGGTCGTGTGAAAAGGATTCACATTGAAGAAGTGTGTACAGAAGTAGAAAGTGCAGCTGAAGCACTGTCTCTAGCAGTTACTGAAGATTTCGTCCCGATCATTTCAAGAATCAAGGCTACTACTGCGCCGCTAGGAGGTCCAAAGGGAGAAATATTGTATGCTCGGGATAATCAATCTGTCTGGTTCAAAGGAAGACGGTTTGCACCAGCTGTATGGGCTGGAAGCCCTGGAGAAGAAGAAATTAAACAATTGAAACCTGCTCTTGATTCAAAGGGTAAAAAGGTCGGGGACGAGTGGTTTACGACGAAGAAGGTGGAAGATGCTTTAACAAGGTACCAAGAGGCCAATGCCAAAGCAAAAGCAAGAGTAGTGGATTTGCTGAGGCAACTTTCCTCTGAATTGCTTGCTAAAATGAACGTTCTAATATTTGCTTCCATGTTACTCATTATCGCCAAGGCGTTATTCGCTCATGTGAGTGAAGGGAGGAGGAGAAAATGGGTTTTTCCTACCCTTGCTGCACCCAGTGATAGGTCCAAGGGCAGGAAATCAATGGAGGGGAAGGTTGGGATGAAGCTGGTTGGACTATCTCCGTATTGGTTTGATGTGATAGAAGGGAATGCTGTGCAGAATAGTATTGAGATGGAGTCGTTGTTTCTTTTGACGGGTCCAAATGGGGGTGGGAAATCTAGCTTGCTTCGATCCATTTGTGCAGCTGCTTTGCTTGGGATATGTGGATTTATGGTGCCAGCAGAGTCTGCCCTGATTCCTCATTTTGATTCTATTATGCTTCATATGAAATCTTTTGATAGCCCTGCTGATGGGAAAAGTTCTTTTCAGGTGGAAATGTCAGAGATGAGATCCATCATGAGTAGAGCAACGGAAAGCAGCCTCGTACTTATAGATGAAATCTGTCGAGGAACAGAAACAGCAAAAGGCACTTGTATTGCAGGGAGCATTGTTGAAGCTCTTGATAAAGTTGGGTGCCTTGGCATTGTCTCCACTCACTTGCATGGTATATTCAATTTGCCTTTAGATATCAATAACACTGTGTTCAAAGCAATGGGAACTGTGTGTACTGATGGCCGAACGGTTCCCACTTGGAAGTTGATCGGTGGAATATGTAGAGAGAGCCTTGCCTTTGAAACAGCAAAGAATGAAGGAATCTGTGAAGCTATAATTCATAGGGCCCAAGATTTGTATCTCTCGAATTATGTTGAACAAGGGATTTCAGGAAAACAGAAGATGAATTTGTATCCCTCAAATTCTTCTCATGCAAGGCTTAATGGCAATGACAAACCCCATCTCCTGTCAAATGGTGTTACAGTAGAAGCTGAACGCCCAAAAACAGAGAAAACTAAGAAAAAGGTTGTCTCTTGGAAGGAAATTGAGGGTGCTATCACTGCAATATGCCAAAAGAAGCTGATAGAGTTTCATAAGGATAAAAACACATTGAAACCTGCAGAAATCCAATGTGTTTTGATTGATGCTAGAGAGAAGCCACCTCCATCCACAGTCGGTGCTTCGAGTGTGTATGTAATTCTTAGACCAGATGGTAAATTCTACGTCGGACAGACTGATGATCTAGAGGGTCGAGTCCATTCACATCGTTTAAAAGAAGGAATGCGGGATGCTGCATTTCTTTATTTTATAGTACCTGGGAAGAGCTTGGCATGCCAGCTTGAAACTCTTCTCATCAATCGACTTCCTGATCACGGGTTACAGCTAACTAATGTTGCTGATGGAAAGCACCGAAATTTTGGCACATCCAATCTCTTATCAGAGAATGTGACTGTTTGTTCATAA |
Protein: MYWVATRNVVSFSRWRFLALLIGFPPRNFTPFTHSPAFLNSERQQLEKLQFGKGRKYSGGSIKAAKKFKDINNVQDDKFLSHISWWKEMVESCKKPSSVQLVKRLDFSNLLGLDINLKNGSLKEGTLNWEILQFKAKFPREVLLCRVGDFYEAIGIDACILVEYAGLNPFGGQRMDSVPKAGCPVVNLRQTLDDLTRNGFSVCIVEEVQGPMQARSRKGRFISGHAHPGSPYVFGLVGVDHDLDFPEPMPVVGISRSARGYCISLVIETMKTYSSEDGLTEEALVTKLRTCQYHHLFLHTSLRNNSSGTCRWGEFGEGGRLWGECNSRHFEWFDGNPLTNLLSKVKDLYGLDDEVTFRNVTISSENRPHPLTLGTATQIGAIPTEGIPCLLKVLLPSNCAGLPALYIRDLLLNPPAYEIATTIQATCRLMSNVTCAIPDFTCFPPAKLVKLLEMREANHIEFCRMKNVLDEILHMHKNCELSNILKLLMDPSSVATGLKIDYDTFVDKCEWASSRVGEMIFLDNESESDQKINSYFIIPNDFFEDMESSWKGRVKRIHIEEVCTEVESAAEALSLAVTEDFVPIISRIKATTAPLGGPKGEILYARDNQSVWFKGRRFAPAVWAGSPGEEEIKQLKPALDSKGKKVGDEWFTTKKVEDALTRYQEANAKAKARVVDLLRQLSSELLAKMNVLIFASMLLIIAKALFAHVSEGRRRKWVFPTLAAPSDRSKGRKSMEGKVGMKLVGLSPYWFDVIEGNAVQNSIEMESLFLLTGPNGGGKSSLLRSICAAALLGICGFMVPAESALIPHFDSIMLHMKSFDSPADGKSSFQVEMSEMRSIMSRATESSLVLIDEICRGTETAKGTCIAGSIVEALDKVGCLGIVSTHLHGIFNLPLDINNTVFKAMGTVCTDGRTVPTWKLIGGICRESLAFETAKNEGICEAIIHRAQDLYLSNYVEQGISGKQKMNLYPSNSSHARLNGNDKPHLLSNGVTVEAERPKTEKTKKKVVSWKEIEGAITAICQKKLIEFHKDKNTLKPAEIQCVLIDAREKPPPSTVGASSVYVILRPDGKFYVGQTDDLEGRVHSHRLKEGMRDAAFLYFIVPGKSLACQLETLLINRLPDHGLQLTNVADGKHRNFGTSNLLSENVTVCS |